CDS

Accession Number TCMCG075C03300
gbkey CDS
Protein Id XP_017969410.1
Location join(31723800..31723988,31724389..31724526,31724989..31725070,31725432..31725508,31726734..31726840,31726985..31727105,31727817..31727933,31728035..31728126,31728336..31728402,31728546..31728624,31728715..31728843,31729604..31729743,31730147..31730287)
Gene LOC18613653
GeneID 18613653
Organism Theobroma cacao

Protein

Length 492aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018113921.1
Definition PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase isoform X6 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category T
Description Heparan-alpha-glucosaminide N-acetyltransferase-like
KEGG_TC -
KEGG_Module M00078        [VIEW IN KEGG]
KEGG_Reaction R07815        [VIEW IN KEGG]
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K10532        [VIEW IN KEGG]
EC 2.3.1.78        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00531        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko04142        [VIEW IN KEGG]
map00531        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map04142        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGTCAACTCTAATCACGATAGCGGAAGAACAACGACAACCTCTTCTTCTCGATCCTTCCCCTGACGGAAACGAAGAAGAGATCGCCGCTTCCTCATCATCGAACGGACCAGATGCCCCTAAACTTACTCTCGACGACTCTAATCAACGGCTCCTATCTCTCGACGTATTCCGTGGCCTCACCGTCGCGTTGATGATTTTGGTTGATGATGCTGGAGGGGCTTTTCCATCTATCAATCACGCTCCATGGTTTGGTGTGACAATCGCCGATTTCGTGATGCCCTTTTTTCTTTTTTGTGTTGGGGTCTCTATTAGCCTTGTATTTAAGAAATCTTCTAGCAAAACATTGGCTACAAAGAAAGTTATATTGAGGACGATCAAACTTTTCCTTCTGGGCTTGTTTCTACAAGGCGGGTATTTTCATGGGCGTGACAATTTAACATATGGAGTTGATGTGGTCAAGATACGATGGCTAGGTGTACTACAGAGGATATCAATTGGATATCTGCTGGCTTCAATATCAGAAATCTGGCTTGTTTACAATGTTGTGGTTGACTGTCCAACAGCATTTGTTAGGAAATATCATGTTCAGTGGATTGTTGCTGCCCTACTATTATCATTTTACATGTGCTTGCTCTATGGCCTTTATGTTCCAAACTGGGAATTTCAAGCTCCAAGCCTGAATCTGTCCACTAATGGATCCCATACTCAAATTGTGCACTGTGGAGTCAGGGGGAGCCTTGAACCTCCATGCAATGCAGTTGGCTACATCGATCAGTATTTTCTGGGTGAACAGCATCTATATCAACGTCCTGTTTATAGAAGAACAAAGGAATGCAGTGTCAATTCTCCTGACTATGGGCCTCTGCCACCGGATTCACCTGAATGGTGCCTTGCACCCTTTGACCCTGAGGGCATTTTAAGTTCATTAATGGCTGTCCTCACTTGTTTTGTGGGATTGCATTTTGGACATGTACTTCTGCATTATAAGGGACAAATGCAGAGAGCACTTTTATGGTCCATGTCTTCCTTTCTGTTGCTAGTTTCAGGATTTGGATTAGAGATGCTAGTTTGTTTTGTACAATTGATGGCTGGTAGGTGTAACAGGCATTCCTCTCTCCAAACCACTGTACACATTGAGCTATATGTGCATCACTGCTGGAGCATCAGGCTTGTTCTTAACCATTATCTTCTACATAATGTCAAACATTTTAGAAAGCCTGTGGTGTTACTTCAGTGGATGGGAATGAATGCTCTCATCGTATATGCTTTGGCTGCTTGTGACATTTTCCCAGCGGCTGTGCAAGGTTTCTATTGGCGTTCACCGGAAAATAACTTGGTTGATGGTATGGAATCATTGCTACAGGCCATGCTTCATTCAAGCAAGTGGGGTACCCTCGTATTTGTATTGCTCCAGATCTTATTTTGGTGTCTTGTTGCCGGTTTTCTCCACATGAAAGGCATATATATAAAACTCTAG
Protein:  
MSTLITIAEEQRQPLLLDPSPDGNEEEIAASSSSNGPDAPKLTLDDSNQRLLSLDVFRGLTVALMILVDDAGGAFPSINHAPWFGVTIADFVMPFFLFCVGVSISLVFKKSSSKTLATKKVILRTIKLFLLGLFLQGGYFHGRDNLTYGVDVVKIRWLGVLQRISIGYLLASISEIWLVYNVVVDCPTAFVRKYHVQWIVAALLLSFYMCLLYGLYVPNWEFQAPSLNLSTNGSHTQIVHCGVRGSLEPPCNAVGYIDQYFLGEQHLYQRPVYRRTKECSVNSPDYGPLPPDSPEWCLAPFDPEGILSSLMAVLTCFVGLHFGHVLLHYKGQMQRALLWSMSSFLLLVSGFGLEMLVCFVQLMAGRCNRHSSLQTTVHIELYVHHCWSIRLVLNHYLLHNVKHFRKPVVLLQWMGMNALIVYALAACDIFPAAVQGFYWRSPENNLVDGMESLLQAMLHSSKWGTLVFVLLQILFWCLVAGFLHMKGIYIKL